Prewarm LLM cache #6692

akatsoulas · 2025-05-30T09:53:04Z

No description provided.

akatsoulas · 2025-05-30T09:53:39Z

kitsune/llm/utils.py


-@cache
+
+@lru_cache(maxsize=1)


cache is basically lru_cache(maxsize=None)

escattone

❤️ this! I added a commit to limit when the LLM cache is pre-warmed.

escattone · 2025-05-30T15:34:06Z

kitsune/settings.py

@@ -1337,8 +1337,6 @@ def filter_exceptions(event, hint):

 USER_INACTIVITY_DAYS = config("USER_INACTIVITY_DAYS", default=1095, cast=int)

-if DEV:
-    GOOGLE_APPLICATION_CREDENTIALS = config("GOOGLE_APPLICATION_CREDENTIALS", default="")


GOOGLE_APPLICATION_CREDENTIALS is only needed for local development, and doesn't have to be added to settings even for that case. It's never used within the dev, stage, or prod environments.

escattone · 2025-05-30T20:31:15Z

I love the idea of this, but it re-introduces the slow start-up issue that caused our deployment problems. I'm going to revert this.

I don't think we need this anyway, because we're not really concerned with the initial startup cost when making the first LLM call.

This reverts commit c055b04.

Prewarm LLM cache

0fb9d63

akatsoulas requested a review from escattone May 30, 2025 09:53

akatsoulas commented May 30, 2025

View reviewed changes

kitsune/llm/utils.py

@cache

@lru_cache(maxsize=1)

Copy link

Collaborator Author

akatsoulas May 30, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

cache is basically lru_cache(maxsize=None)

escattone approved these changes May 30, 2025

View reviewed changes

pre-warm LLM cache only if project defined

6ceafcd

escattone reviewed May 30, 2025

View reviewed changes

escattone merged commit c055b04 into mozilla:main May 30, 2025
2 checks passed

escattone added a commit that referenced this pull request May 30, 2025

Revert "Prewarm LLM cache (#6692)"

0bf2a89

This reverts commit c055b04.

escattone mentioned this pull request May 30, 2025

Revert "Prewarm LLM cache" #6694

Merged

escattone added a commit that referenced this pull request May 30, 2025

Revert "Prewarm LLM cache (#6692)" (#6694)

ab1cdd3

This reverts commit c055b04.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Prewarm LLM cache #6692

Prewarm LLM cache #6692

Uh oh!

akatsoulas commented May 30, 2025

Uh oh!

akatsoulas May 30, 2025

Uh oh!

escattone left a comment •

edited

Loading

Uh oh!

escattone May 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

escattone commented May 30, 2025 •

edited

Loading

Uh oh!

Uh oh!

Prewarm LLM cache #6692

Prewarm LLM cache #6692

Uh oh!

Conversation

akatsoulas commented May 30, 2025

Uh oh!

akatsoulas May 30, 2025

Choose a reason for hiding this comment

Uh oh!

escattone left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

escattone May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

escattone commented May 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

escattone left a comment •

edited

Loading

escattone May 30, 2025 •

edited

Loading

escattone commented May 30, 2025 •

edited

Loading